University of Maryland, College Park
flu sick fever cough sweat ache pain fatigue nausea vomitting diarrhea lymph
We overlaid a heatmap of the remaining 42275 tweets on the Vastopolis map which highlighted areas of Downtown in red. We then divided the tweets by day and looked at daily heatmaps. Daily analysis revealed a spike in the number of 'sick' tweets on 18th of May. Looking at hourly tweet counts on May 18th-20th, we observed that tweet counts increased significanly on May 18th at 8a.m. (see Fig. 1). The heatmap analysis for May 18th confirms that disease started spreading on that day from the Downtown area (see Fig. 2). In particular, there was a significant number of tweets from around Vastopolis Dome, Convention Center, and Vastopolis City Hospital.
Figure 1. Hourly tweet counts on May 18th - May 20th
Figure 2. Heatmap of tweet counts for May 18th at 8a.m.
log(c+1)
where c
is a tweet count in
a given cell of the heatmap grid. Figure 3 shows that, apart
from Downtown, part of Eastside and the banks of the Vast river get contaminated.
Hospitals in each of the burrows stand out as well.
Figure 3. Heatmap of a log of tweet counts for May 18th at 8a.m.
A plot of tweet log maps over 3 days shows how "sick" tweet distribution changes over time. We plotted
tweet log heatmaps for every day (Fig. 4 shows three last days of the data when disease really takes off). On May 18th, the disease started in the Downtown area and spread to the Eastside - possibly
carried by the strong wind from the West (as indicated in the Weather.csv
). Sick tweets started dispersing through the burrows in the evening after 5p.m. (evening commute from center to the suburbs) and were published at a steady rate throughout the night (see Fig. 1 above). On May 19th, we observed further disease spread in the westward direction due to the strong wind. However, there was a significant
number of "sick" tweets along the banks of Vast river downstream of infected Downtown. (We determined the direction of the river by the discharge on one of the dams).
May 18th
May 19th
May 20th
Figure 4. Daily logarithmic heatmaps of tweet counts related to sickness.
These two observations made us hypothesize that the disease may spread by air as well as by water. While evidence for disease being airborne is strong, we were puzzled to see that there was no increase in "sick" tweets around other bodies of water in the city (lakes Pasta, Bread, Rice, Disco, and Twin Lakes). However, it is better to be safe than sorry, so we conclude that both ways of disease spread are highly likely.
healthy | sick | % sick | |
before | 73929 | 4966 | 6.3 |
after | 67209 | 21678 | 24.4 |
Table 1. Tweet counts mentioning sickness vs. all other tweets before and after May 18th
Table 1 shows tweet count that contained references to sickness as well as "healthy" tweet counts before and after May 18th. A significant increase in "sick tweets" (24.4% of all tweets in 3 days) indicates that this new disease started an epidemic and its elimination may require drastic measures. Since its spontaeous onset on May 18th, the disease spread fast and infected a quarter of the population - all facts indicate that the disease is highly contagious and the outbreak can spread outside the city easily.
Symptoms and the method of transmission remind those of the West Nile virus: the virus is spread by mosquitoes, which could be carried by the wind, but also more prevalent near water.